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I. INFRODUCTION 


A. BACKGROUND 

Extensive work has been accomplished recently in the area of modeling and 
predicting quarterly overhead costs for aircraft manufacturers. Overhead costs are 
generally predicted utilizing estimated overhead rates applied to labor hours or costs 
over functional categories such as manufacturing or engineering. Changes in operating 
rates cause overhead rate changes which may be observed only after a significant lag in 
time. Recent work done in this area has been to estimate total overhead as a function 
of the number of direct manufacturing personnel [Ref. 1]. 

Data collected for purposes of estimating overhead costs, since it is of a time 
series nature, can be expected to exhibit some degree of autocorrelation. Specifically, 
data collected for recent purposes has exhibited first order autoregressive AR(1), fourth 
order autoregressive AR(4), or a combination of these processes. An AR(1) process 
occurs when the errors in adjacent time periods are related. This type of relationship 
would be expected in yearly cost and operating data. A special form of the fourth 
order autoregressive would be expected in data of a seasonal nature, 1.e. quarterly 
observations. Errors in this special case of the AR(4) process would be related to 
errors Which occured four quarters previously. 

The method of ordinary least squares (OLS) 1s used to regress the data and 
estimate overhead costs. Ordinary least squares’ validity is based on certain key 
assumptions: 


e Errors are distributed independently of the explanatory variables with zero mean 
and constant variance. 


e Successive errors are independent of each other. 
With autocorrelation present, assumption two 1s violated and the ordinary least 
Squares procedure breaks down at three points: 
e Estimates of regression coefficients, though unbiased are not efficient. 


e The usual formula for the variance ofan estimate no longer applies and 1s liable 
to seriously underestimate the true variance. 


e reap) distributions, used for making confidence statements are no longer valid 
aie AE 


Since the OLS procedure breaks-down with the presence of autocorrelation, it 1s 


important that its presence be detected and the disturbance in the data subsequently 


corrected. Without accounting for these disturbances in the data, estimates of the 
regression coefficients will not have minimum variance. 

The Durbin-Watson statistic is utilized to detect the presence of AR(1) behavior. 
Additionally, it has been generalized to detect higher order autoregressive processes. 
Durbin and Watson’s work was based on the findings of T.W. Anderson which showed 


that the statistic 


e Acyee (eqn 1.1) 


where e is the column vector of residuals from a regression and A 1s a certain real 
symmetric matrix, provides a test that is uniformly most powerful against certain 
alternative hypotheses [Ref. 2]. 


The general model upon which Durbin and Watson’s work was based 1s 


Y See (eqn 1.2) 
where 
En OFee ate te t= Loe T (eqn 1.3) 


where X, is a (1 x K) non-stochastic matrix, B is a (K x 1) vector and i is one for an 
AR(1) process and four for an AR(4) process or a mixture of both processes. The 


error component, €,, 1s specified by the form above and n, is distributed with zero 


t? 
mean and constant variance. 

Typically an ordinary least squares regression is performed on_ selected 
independent variables. If the presence of autocorrelation is suspected, the data may be 
tested for a particular AR process utilizing the appropriate form of the Durbin-Watson 


Statistic. For an AR(1) process the form of the Durbin-Watson statistic is 


Z 24 2 
d)=d(e,-e..)7/ d (eqn 1.4) 
vrZ és/ 


where 


(eqn 1.5) 


Vas ~ 
Y, = X,B (eqn 1.6) 


The estimator of B from the ordinary least squares regression is p The null hypothesis 
for the test is that of zero autocorrelation in the residuals against an alternative 
hvpothesis that a first order autoregressive process exists. 

If the presence of an autoregressive process is detected in the regression residuals, 
the linear model must be reestimated after transforming the original data. For an 


AR(1) process, the independent variables are transformed as 


ee loos cel (eqn 1.7) 
and 
* 
Sg Mamie f= 2)2 18 (eqn 1.8) 


In a similar fashion, the dependent variables are transformed as 


D3 
Me -Mdlepy)'? t= (eqn 1.9) 
and 
** , 
Ee 1-7) t= 2. (eqn 1.10) 


In order to accomplish this transformation an estimate for Pp, 1s required. 
Though this parameter can be estimated in several ways, the most popular method due 


to simplicity of calculation is 
py Poeun (eqn 1.11) 


Where d, is given in equation 1.4. In addition to the ease of calculation, this estimator 
performs as well as more complex estimators which are available {Ref. 3]. 

After the data is transformed, generalized least squares(GLS) 1s used to 
reestimate model parameters. After GLS is performed, the model 1s rechecked for the 


presence of autocorrelation using the Durbin- Watson statistic. 


B. — FOCUS AND SCOPE OF RESEAKGi 

As previously mentioned, regression data is at times influenced by multiple 
autoregressive processes. In overhead cost estimating, both an AR(1l) and AR(4) 
process have been observed to simultaneously influence certain data sets [Ref. 1]. In 
the linear least squares model 


ae Ete (ediamina 


t 


suppose the errors are serially correlated and of the form 


&. = O16.) eae eee (eq iaiG) 
where the effects of the second and third prior quarters are negligible compared to the 
effect of the most recent and year-earlier quarters. The n,’s in the model, as in the 
previous version, are independent and distributed with zero mean and constant 
Variance. 

It is advised [Ref. 4: p. 211] that if this special form of the AR(4) process exists, 
that it be tested for by a two-step procedure. In this procedure, after initially 
performing a OLS regression, the residuals are tested for an AR(1) disturbance utilizing 
the Durbin-Watson statistic for a first order process. If the influence of an AR(1) 
process 1s detected, a value for p; is estimated and the original data is transformed. 
GLS is then performed on the transformed data and the residuals are tested for the 
influence of a fourth order autoregressive process by means of Wallis’ test [Ref. 5S]. If 
an AR(4) process is present, a value for Pg is estimated and the data is transformed a 
second time. At this point, the procedure is repeated to determine if either form of 
autocorrelation is still present. 

Unless the individual performing the test has a priori knowledge that both an 
AR(1) and AR(4) process exist within the data, it is very likely that one or the other 
would be overlooked. Additionally, peculiarities in the data may preclude the detection 
of one or the other processes. If this occurred, the GLS regression estimates as 
previously discussed would be inefficient. In order to avoid the problem above, it 
would be desirable to have a procedure to detect simultaneously both an AR(1) and 
AR(4) process if they existed in the data. This process, which is a special case of an 
AR(4) process shall be specifically referred to as an AR(1,4) process. In addition, if 


the AR(1,4) process exists then there must be a way to transform the data and 


10 


calculate estimates for the parameters of the process. This proposed procedure for the 
AR(1,4) process would prevent oversights of existing processes and would save time 
expended in having to correct for the AR(1I) process and then the AR(4) process. 
Most importantly though it would insure that if the AR(1,4) process was present in the 
data, it would be corrected. In this way any peculiarities in the data which might 
prevent detection and correction of either the AR(I) or AR(4) process would be 
avoided. 

As previously discussed, with an autoregressive disturbance present in the data, 
the OLS assumption of independent error terms is violated. This is related to problems 


in the error covariance matrix which is denoted as 
E(ee’) = oy, (eqn 1.14) 


where e is a vector of regression residuals. The problem in the error covariance matrix 
is that there exist off diagonal terms not equal to zero. This condition can indicate 
that some form of autocorrelation exists. An additional problem in the error 
covariance matrix is that the diagonal elements are not equal. This condition indicates 
that the error terms are not distributed with a constant variance. Errors distributed in 
this manner cause estimates of regression coefficients to be inefficient though unbiased. 


If the error covariance matrix equals 


o7y (eqn 1.15) 
instead of 
67] (eqn 1.16) 


A 
where I is an identity matrix, the regression coefficient B is correctly defined by 
xy lxy ixvycly, (eqn 1.17) 


A 
The general procedure [Ref. 6: p. 440] used to attain an efficient estimate for B is: 
¢ Find a matrix P such that P’P = wl. 


e Using the matrix P, transform the original data set where 


1] 


Yo i2X, (eqn 1.18) 


SC ies teqn 1.19) 


e Perform GLS on the transformed model 


Y"=X Pte (eqn 1.20) 
where 

* 

e = Pe (eqn 1a 


to estimate the regression coefficients where 


Aw , 
R=(X kop Vo (eqn 1.22) 


C. RESEARCH GUESRIONS 
Following the procedure as outlined above, this paper will address the AR(1,4) 
process. Specifically, this thesis will attempt to develop: 
e A statistic capable of detecting the AR(1,4) disturbance. 
e A P matrix for a transformation to account for the AR(1,4) disturbance. 


e A method to estimate parameters required for the transformation. 


D. RESEARCH METHODOLOGY 

In reaching the first objective, the statistic will be calculated by estimating the 
distribution of the ratio of two quadratic forms in normal variables. This method 1s 
based on Imhoff's [Ref. 7] technique and is outlined by Koerts and Abrahamse [Ref. 8]. 
These values shall be calculated for an extension of Schmidt’s statistic [Ref. 9] to the 
AR(1,4) process. This statistic was developed in a manner similar to Durbin and 
Watson's and 1s essentially based on their work. The power of this statistic to detect 


the AR(1,4) process will be compared to the current sequential method of testing. 
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To account for the autoregressive disturbance, it is required that there be a P 


matrix capable of transforming the data with the property that 
PP = wl, (eqn 1.23) 


The inverse of the yw matrix for the fourth order process was derived by Siddiqui 
[Ref. 10]. With the form of the y inverse matrix known, a P matrix with the above 
stated property can be found using a method proposed by Beach and MacKinnon 
[Ref. 11]. Both the wy inverse matrix and P matrix are expressed in terms of 0's or 


autoregressive parameters from the transfer function 
eee eres ts c4' 24 ‘ly (eqn) 


Utilizing the Yule-Walker equations [Ref. 12: p. 55], the autoregressive parameters 
from this transfer function were expressed in terms of the autocorrelation coefficients 
or p’s. Once the 0’s were determined in terms of the p’s, substitutions were made into 
the derived P matrix. The resulting P matrix can be used to transform an AR(4) 
process as represented by the error term above. Since this error term is not 
representative of the AR(1,4) model, corrections were made to the P matrix by setting 
appropriate values of p equal to zero. For Schmidt’s statistic, which will be introduced 
in Chapter II, p> and p3 were set equal to zero. The resulting form of the P matrix 1s 
that required to transform data with an AR(1,4) disturbance. 

With the P matrix defined, the next task is to find estimates for p,; and py. Since 
the method outlined above is generalized least squares, one approach for estimating the 


values of p, and Py is by the sample correlation coefficients 
age », Sct e” s=1,4 (eqn 1.25) 


where e, is a vector of least squares residuals. Since estimators derived in this manner 
are subject to considerable small-sample bias, the approach which will be developed 
will be to estimate p,; and Py by direct utilization of the test statistic. This method 1S 


analogous to Durbin and Watson’s method where 


fips 1-54, (eqn 1.26) 
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and d, is the Durbin-Watson statistic for an AR(1) process. 

After developing the above outlined procedure and generating the bounds for 
Schmidt’s statistic a major question which must be answered 1s: How well do the 
procedures work in comparison to each other? In order to make the comparison, 
simulations will be used which utilize least squares residuals generated for various 


values of p; and py using a formula for the error of 


ay = Py et-] + Pat4 - n, (eqn er) 


Where , is distributed normal with zero mean and specified variance t, With 
residuals calculated as such, the Schmidt, Durbin-Watson and Wallis statistics will be 
calculated. In addition to the Durbin-Watson statistic, after appropriately 
transforming the residuals, the Wallis statistic will be used in the two-step method to 
test for the presence of an AR(4) disturbance. For various sets of residuals generated 
with different variances for the normal error term, the power of the statistics to detect 
the AR(1,4) disturbance will be determined. Additionally, the number of observations, 
T, will be varied by varying the number of residual terms. Thus the power of the 
various Statistics to detect the AR(1,4) process will be determined for different values 
of T, p, and py and t”. Residuals used in the above calculations will also be used to 
calculate estimates of p, and py. Utilizing mean square error, calculated from 
simulation data, the efficiency of derived formulas to estimate p, and py will be 
determined. Specifically, the ability of both procedures to detect the AR(1,4) 


disturbance will be determined for a best case with T= 100 and 1 = 01 anda worst 


case where T= 20 and t* = 10. 

The ability of the entire procedure derived for Schmidt’s statistic in estimating the 
regression coefficients will be determined for a best and worst case scenario. The best 
case will be for t? = .01 and T = 100, while the worst case will be for <> = 10andT 


= 20. Using the regression model 


Ye Dee (eqn 1.28) 
where 
fe = Pyfr_1 + Prag + Mp (eqn 1.29) 


for X, = .l and B = | the Y,’s are generated for various values of py and py. Once 
the Y,’s are determined, an estimate of B will be calculated. Mean square error will be 
used as a measure to compare estimates of B with their known value of one. This 
measure will indicate the efficiency of regression parameters estimated by the procedure 


as based on Schmidt’s statistic. 


EF. THESIS ORGANIZATION 

The thesis 1s organized into four chapters. Chapter I] will introduce background 
theory utilized to derive the AR(1,4) procedure. In particular, the Durbin-Watson test 
for the AR(1) process will be addressed along with the Wallis and Schmidt tests for 
special cases of the AR(4) process. In addition to test procedures, the transformation 
for autocorrelated data will be discussed. 

Chapter II] will initially develop the test for the AR(1,4) process. Specifically, 
upper and lower bounds will be developed for the Boger and Schmidt statistics. After 
deriving the test, the data transformation for the AR(1,4) process will be addressed. 
Next, results will be presented concerning the effectiveness of the derived results for the 


AR(1,4) process. Finally, conclusions and areas of future study will be discussed. 


Il. BACKGROUND THEORY 


A. GENERAL 

In this section certain concepts underlying all time series processes will be 
defined. These definitions will lay the groundwork which will lead into discussion of 
specific test procedures. In particular, the Durbin-Watson, Wallis and Schmidt tests 
shall be discussed. Discussion of these procedures will act as a prelude to the next 
chapter’s development of a test and correction procedure utilizing an extension of 
Schmidt's statistic. 

A sample of error residuals e),....e7, where the index denotes a point in time, is 
called a time series. Each observation in this sample is a realization of a random 
variable and as a result this sequence is considered to be a discrete stochastic process. 
The sequence is considered discrete since observations are made at a fixed interval of 
time. Additionally, ereeen is considered as a finite subsequence of an infinite series 
whose joint distribution is defined by these finite subsequences. 

Only stationary stochastic processes will be considered in this paper. A process is 
considered stationary if the joint probability distribution of the residuals e1,....e7 is 
unaffected by shifting the time origin forward or backward by k time units. That ts, 
Cp €7 and Cp 4 poer€e 4 T, are identically distributed [Ref. 12]. 


The general linear model with autoregressive disturbance of order p is defined as 


Ye = 2 pata. (eqn 2.1) 
with 
f= 016. 8 5Et-p +n, (eqm2.2) 


where Y, is the value of the dependent variable at time t, X,’ is a (1 x K) vector 
representing an observation on K non-stochastic variables at time t and B is a (K x 1) 
vector of coefficients to be estimated. Equation 2.2 implies that error disturbances in 
the current time period depend on those in previous time periods and another error 


term which is independent with zero mean and constant variance. 
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Autocovariance defines the linear relationship between members of a stochastic 


sequence. In particular, the autocovariance, ¥,, between €, and & 4, is given by: 


Ye = El(e,-Ele,)e, 4 -E(E,44))) k= 0,1,2,.. (eqn 2.3) 


where E(€,) is the mean of the stochastic process, which is zero in our application. For 
a stationary process the covariance between €, and €, +}; depends on k and not on the 
particular point in time. The above formula completely describes the autocovariance 
Structure of the stochastic sequence as defined above [Ref. 13: p.226]. 

Autocovariance 1s dependent on the unit of measurement for the underlying 
variable. To account for this, the y,'s are normalized by dividing by Yo, which is the 


variance of €,. Dividing 7, by Yo results in the autocorrelation function of €;: 


ie Sey  *K— 0, 1,2... (eqn 2.4) 


From the above definition of the autocorrelation function it 1s obvious that 


Py = 1. (eqn 2.5) 


In order to gain estimates for the 8’s or autoregressive parameters in terms of the 
autocorrelation coefficients or p’s, the Yule-Walker equation is used [Ref. 12: p. 55]. 
This equation for the above general linear model is derived by left multiplying the 


equation 


€,=O)e,.)+...+ 0,8, ,+ nt (eqn 2.6) 


by €,.,, and then taking expectations and dividing by the variance Yo of €,. This 
results in the Yule-Walker equation 


p, = Cpe +: + OPK p fork = 12he. (eqn 2.7) 


By substituting k=1,2,...p into the above equation, a set of linear equations for 
0),0>...0,, in terms Of Pj .Po---Pp are obtained. By substituting estimates for Py -P2---Pp 


into the above equations, the autoregressive parameters are estimated. Various other 


methods exist to estimate the p’s, and some will be discussed later. This result is 
important in the specification of the autoregressive process and will help in the 


development of a transformation matrix for the autocorrelated data [Ref. 12]. 


B. DURBIN-WATSON TEST 

This section summarizes the most important features of the theory for the 
Durbin-Watson test [Ref. 2,14,15]. This theory is the basis for later generalizations of 
tests for autoregressive disturbances. 


The Durbin-Watson test is based on the statistic 


A ae 
d= Y(e,-€;.7) Dag = e’Ac/e’e (eqn 2.8) 
ay t?/ 


where e = y - Xf is a vector of least squares residuals and A equals: 


1 -l QO 0" 0.005500 0 
=] 2 7) SOR 0 Oe ee ee 0 
O--l, 2 =1> 0 30> 902 RO aso 0 


For certain cases where regression vectors are eigenvectors of the matrix A 


occurring in the residuals distribution, the statistic 
GeAce mee (eq 2.4) 
provides a test which is uniformly most powerful against certain alternative hypotheses 


(Ref. 16: p. 88}. 


The general linear model was previously defined as 


YS uiPackc, (eqn 2.10) 


where Y 1s a (T xX 1) matrix, X 1s a (T x K) matrix of T observations on K variables 
and € is a (T x 1) matnx of errors. The least squares estimate of B is B which is given 


by: 

MN ae 

Bp = (X’X)°X’y. (eqn 2.11) 

Also, e, the vector of residuals from the regression, is defined as 
A 

e= y-XP = My (eqn 2.12) 
where I is an identity matrix of order T, and 

ele): (eqn 2.13) 


It can be verified that the matrix M is idempotent. Substituting XB + e for y into the 


formula for e leads to the result 

e= Me. (eqn 2.14) 
This result implies that 

d = eMAMe/ e’Me. (edn? as) 


The distribution of the above statistic is dependent on the (T - K) non-zero eigenvalues 
of AM which are denoted by 6;. Values for 6; are dependent on M through the matrix 
X. 

The matrix A is a (T x T) symmetric matrix with (T - 1) positive eigenvalues 
which for T greater than or equal to three lie in the closed interval from zero to four. 


The eigenvalues of A will be denoted by i and are defined as 


h =e le=cos(y ie | = La, I-1. (eqn 2.16) 


i, 


Durbin and Watson showed that if the e, are assumed to be normally distributed, 


there exists an orthogonal linear transformation of e to v such that 


T-K 7-X 
d = Yi? / Yv,’, (eqn 2.17) 
Cs/ 


afl 


where v; are independent and identically distributed normal variables with zero mean 
and variance 6%. The distribution of d thus depends on the 6; which are the 
eigenvalues of M [Ref. 2: p. 412]. 

Using the above result to obtain the distribution of d is tedious, since the 0,’s 
depend on the X matrix and the statistic would have to be calculated for each new 
matrix. Consequently, if it is desired to have a test which is not restricted to a 
particular X matrix, the distribution would have to be determined for all X matrices. 
This task 1s impossible since there are an infinite number of matrices. 

Durbin and Watson avoided this problem by constructing a bounds test where 
the upper and lower bounds, dj; and d, , respectively, are independent of the particular 
X matrix. They were able to accomplish this by determining inequalities on the }: in 
terms of the eigenvalues h: of the matrix A, where the 6.’s and i.’s have been arranged 


in increasing order. This result implies that 


dj SdSdy (eqn 2.18) 
where 
T-K T-k 
“Ue De hit KY : [dv So (eqn 2.19) 
(=/ (a/ 
and 
7-K 5 7-K 5 
dy = LAY; / Ne Vv i (eqn 222) 
(x / ¢4a/ 


Durbin and Watson were unable to find the exact distribution of these bounds but 
approximated the distribution with the first four terms of a series expansion in terms of 
Jacobi polynomials using a Beta distribution as a weight function. 


Using the statistic 


Z 2h 
RE Fe | ee (eqn 2.21) 
{225 t=/ 
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and asymptotic results it can be shown that 
d= 2(1-p), (eqn 2922) 


where fp is an estimate of the autocorrelation coefficient. This result shows that when 
p equals zero, indicating no autocorrelation, d equals two. In addition, when p equals 
one indicating positive autocorrelation, d equals zero and when p equals negative one, 
d equals four. In testing the null hypothesis of no autocorrelation against the 
alternative hypothesis of positive autocorrelation, the null hypothesis would be rejected 
if the calculated value for d is less than the lower bound calculated and accepted if d is 
greater than the upper bound. If the calculated value of d falls between the upper and 
lower bounds the test is considered inconclusive and there is insufficient evidence to 
accept or reject the null hypothesis. 

Since Durbin and Watson first calculated upper and lower bounds for their 
statistic using an approximate distribution, a method has been developed by Imhoff 
Which allows the calculation of an exact distribution for a quadratic form in normal 
variables [Ref. 7: p. 419}. To briefly indicate how the Imhoff distribution is utilized in 
the calculation of the statistic’s bounds, only the lower bound will be considered. The 
calculation of the upper bound can be done in an analogous manner. In order to 
calculate the lower bound, the eigenvalues of the A matrix, hi, are calculated. For the 
A matrix, there will be (T - 1) non-zero eigenvalues where T is the order of the A 
matrix and the one zero term is the eigenvalue which corresponds to the constant term 
of the regression. From these eigenvalues, the (T - K) smallest are selected to compute 
the statistic’s lower bound [Ref. 8: p.70]. The value for K is equal to the number of 
regressors including a constant term. The basis for the above procedure 1s the result 
proved by Durbin and Watson which determined inequalities of the eigenvalues oF of 


the matrix AM in terms of the eigenvalues A, of the A matrix or 


Nee hey K i = lee. THK - (eqn 2) 
where 
R= Kel (eqn 2.24) 


ot 


and K’ is the number of regressors not including a constant term. Analogously, to 
calculate the upper bound the (T - K) largest eigenvalues are selected. Adapting the 
Imhoff distribution to this problem leads to 


F(d; ) = 1/2 - 1/nf (( sin @(u)) / (u@(u))) du (eqn 2.25) 
) 
where 
7-K 
p(u) = .5 » tan !(u.u) (eqn 2.26) 
is] 
w@(u) = Tit a5 u2u2yl/4 (eqn 2:27) 
V7 
and 
Wi Ase dr (i = Tee T-K) (eqn 2.28) 


Since the h.’s are known, a value for dy is selected and the u,’s are calculated. 
Using the u,’s, the integral is then evaluated numerically. Numerical integration of this 
infinite integral 1s possible since Imhoff has proved that the integral’s limit as u 
approaches Zero 1s 

7-K 

1/2) us. (eam: 2-29) 
raf 


Since the integral’s range is infinite, there will be a truncation error associated with this 
integration. Imhoff has shown that the truncation error caused by integrating over the 


finite range from 0 to u inclusive, can be held to any arbitrary level by taking 
77K 2 g 
u = (((T - K)/2) mpl |u,|!/2 ) (eqn 2.30) 
ey] 
where g = 2/(T-K). Additionally, the numerical integration will be subject to error 


related to the method of integration. This error is controlled by setting the error 


tolerance within the program used to integrate the function. 
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C. EXTENSIONS TO HIGHER ORDER PROCESSES 

Durbin and Watson’s original work has been extended to higher order 
autoregressive processes. Specifically, Wallis has extended it to a special case of an 
AR(4) process and Schmidt has extended it to a second order process. For the 


regression model 
we= XB + e, (eqn 2.31) 
with an error specification 


E = pe, 4g + ny, (eqn 2.32) 


Wallis generated bounds for the statistic 
7 5 7 on 
ay = GE, -€,_4) re. = € Age | ete. (eqn 2.33) 
as t2¢ 


Wallis’ derivation of this result is analogous to Durbin and Watson’s. Using 
Imhoffs distribution, Wallis was able to calculate bounds for this special case of the 
AR(4) process [Ref. 5: p. 617]. Vinod, in a manner similar to Wallis, further extended 
these results to include bounds for the same special cases of the AR(2) and AR(3) 
processes [Ref. 17]. 

In order to account for an AR(1) or AR(4) disturbance, the data must be 
transformed. In this regard, for the AR(1) process the dependent variables are 


transformed as 


, * 
Y, =Y(i-p/2, t= (eqn 2.34) 
and 
* 
be = 1G : upoesle Tom 2S, pees As (eqn 25) 


Similarly, the independent variables are transformed as: 


m= Xa apy), t= 1 (eqn 2.36) 


t 
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and 
* 
Od. CAG ar t= 2,3; (eqn 2.37) 
In order to utilize this transformation, an estimate for p, is required. Several 
methods are available to estimate this parameter. The method of choice due to its 
favorable properties and ease of computation [Ref. 3] is 


py = 1- Sd). (eqn 2.38) 


For the special case of the AR(4) process the transformation is_ similar. 


Specifically, the dependent variables are transformed as: 


k 
Y, =Y,(i-pq)/2 t= 1,2,3,4 (eqn 2.39) 
and 
* 
SEMIN 2 Sha (eqn 2.40) 


The independent variables, similarly are transformed as: 


x, =) Xi(1 294g tele (eqn 2.41) 
and 
* 
Xo: = Xiph  e (eqn 2.42) 


As in the AR(1) case, an estimate of pg is required. The formula for this estimate is 
the same as before except that the Wallis statistic is used in lieu of the Durbin-Watson. 
Once either of these transformations is utilized after the detection of its respective 
disturbance, estimates of regression parameters can be expected to be efficient and 
unbiased. 

Schmidt [Ref. 9] generalized Durbin and Watson’s preliminary work to the case 


ofa second order autoregressive process whose disturbance is of the form: 
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E, = Pye + Poko + Nye (eqn 2.43) 


The statistic which Schmidt derived to detect this disturbance was: 


where 
e SSG 
re iG Cope as (eqn 2.45) 
422 ¢*/ 
and 
4 RD 
5 = rm - e+.) [de ; (eqn 2.46) 
3 i) 


In matrix notation this statistic is represented as 
dy = e(A, + Avje/ee (eqn 2.47) 


where A, is the matrix as previously defined for the Durbin-Watson statistic and A> 1s 


defined as 


POO 0 0 0 0 OO 0 
tae) O° [UO CO 0 
eee eet KO - OO 0. 0 v 
Viewer 20, -l 0 O 0 0 
-1 O 0 -l 

l 0 

(rales 0 | 
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Following results proved by Durbin and Watson, Schmidt generated the statistic, d5, 
utilizing _Imhoffs distribution and the eigenvalues from the matrix (A; + A>). Ina 


manner similar to previous developments, he further showed that 


dy = 2(2- py - Pp). (eqn 2.48) 


D. SEQUENTIAL TESTING 
In development of a generalization of the Durbin-Watson statistic, Vinod, 

derived a scheme of sequential tests for the presence of autocorrelation [Ref. 17]. In 
this procedure a sequence of tests is used to determine if AR processes of sequentially 
increasing order exist. As an example, the null hypotheses for the four tests required 
to detect a fourth order process would be: 

“lig: Py = 2, 

© Hg 7 P> = Uewenl pia), 

© Ho: pz = Ogiven p; = py = 0, 

© Ho: Pq = O given py = Po = P3 = 0. 
Thus in order to test for an autoregressive disturbance, the null hypotheses for 


sequentially greater orders are tested until one of the null hypotheses is rejected. 


h 


Specifically, at the iP step,} > 1, the hypothesis, Ho: p; = ( given that Pp /=ap = 


= Pj. = 0 1s tested against the two-sided alternative el aD and P; < 0. 


J 
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Ill. THE AR(1,4) PROCESS 


A. INTRODUCTORY ANALYSIS 

In this section stationary conditions and the development of the error term for 
the AR(1,4) process will be presented. If stationary conditions do not hold the 
AR(1,4) model is invalid and a model which considers a dynamic specification must be 


pursued. In particular, stationary conditions must be determined for the regression 
model 


es Pate fy (eqn 3.1) 


whose error term 1s given by: 


As discussed in previous chapters, the n,’s are assumed to be distributed normally with 
zero mean and constant variance. 

Wise [Ref. 18] derived a method to determine the stationary conditions for a 
stochastic process of the autoregressive and moving average types. Following his 
procedure, it is possible to show that the stationarity conditions for a fourth order 
autoregressive process are 

Poe | 2p, > 0, 

Gp py 0, 

D) 2-0, pean Py 0, 

over] - Po Pa ~ Pq > 9, 

© 1+ P)-P> + P3- Pg > 0. 
Since Pp» and p3 are assumed to be zero in the AR(1,4) process, these stationarity 
conditions reduce to 

eee Py Zia OY, 

oe en, 4%, 

Se Py a 20 ees 

eae Pao, 
1+ Pp, - Pg > 9. 
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The error process for a general fourth order process is defined by the relation: 
€,=9)6.; +05£, 5+ 0383+ 948, 4. (eqn 3553) 


If the relationship is multiplied through by €,, and expectations are taken, the 


difference equation 
Vq= O81 FO2V RF F Ogg kK 0 (eqn 3.4) 


results, Where Y was previously defined as the autocovariance. If this equation 1s 


divided through by Yp, the result is 


which 1s the general form of the Yule-Walker equation and defines the autoregressive 
parameters, 9’s, in terms of the autocorrelations, p’s. If k=1, 2, 3, 4, is substituted 
into this equation a set of linear equations is obtained for 0,, 0,, 0, and O4 in terms of 


Pi» Po, P3 and We Specifically, these equations are 


p= 9; +95) +859 + 8403, a 
and 
P4=9)P3+ Op. + O3p) + Og. (eqn 3.9) 


Since the error process for the AR(1,4) process is specified as 


£, =O. ,+ Oye, 4+ 0, (eqn 3.10) 
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where @, and 0, are assumed equal to zero, the above equations can be reduced to 


p> = O)p, + Bgpp, (eqn 3.12) 

p3 = O)p> + Ozp,, (eqn 3.13) 
and 

Py = O)p3 + Oy. (eqn 3.14) 


Further since it is assumed that p, and p3 are not present in the disturbance, the above 


equations can be finally reduced to 


p, = 9, (egmis. 15) 
and 
Py = Q,4. (eqn 3.16) 


These final equations allow the AR(1,4) error term to be represented as: 


E, = PpEry + Pg&i-g + Mh (eqn 3.17) 


B. BOGER’S STATISTIC 
Boger proposed a statistic which was a variation of the Durbin-Watson statistic 


to test for the AR(1,4) disturbance. This statistic is of the form 


A 2, Gd 
dia ae (e,-€,. 7-4)” / UG, (eqn .3ahs) 
625 ee 


In matrix notation this statistic 1s denoted as 
dy q=evAy ge / ee (eqn 3.19) 


where A, 4 equals 


0 0 1 -l O 9 0 0 
1 “0: [OSI iso 0 
Q. 1 oO Us eel 0 
0-0 To OS OReT= 0 
ee ee) Cie 0 Ue 
“lol 0 -=)h5, 2 cle ae? 





Bounds for Boger’s statistic could not be determined because its A matrix as 
defined above did not satisfy properties as established by Durbin and Watson. In 
particular, Durbin and Watson [Ref. 2] showed that the matrix product MA, where A 
is any real symmetric matrix has (T-K) real positive eigenvalues of which K are zero 
eigenvalues. T and K are the dimensions of the matrix X. Specifically, Durbin and 


Watson showed for their statistic whose A matrix equals 


lL -1 0.0 O09 3 5 9 ee 
-1 2-1 0 0 

0-1 2 -1 0 

0 

0 

0 

0 AE 
0 12-1 
| 0 Ha | 
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that there were T-1 positive eigenvalues. As previously discussed, they also showed 
that if the positive eigenvalues from both MA and A are placed in increasing order, 


bounds for statistics of the form 
dee AG / cc, (eqn 3.20) 


can be calculated. 

Where Boger’s statistic violated these properties was that for all values of K or 
the number of independent variables, there were not sufficient eigenvalues in the A 
matrix to bound those eigenvalues in the matrix MA. Specifically, the A matrix for 
Boger’s statistic had one zero and three negative eigenvalues where Durbin and 
Watson's had one zero eigenvalue. Since the matrix MA had T-K positive eigenvalues 
and Boger’s A matrix only had T-4 positive eigenvalues instead of T-1, when the 
number of independent variables was less than or equal to three, there were not 
sufficient eigenvalues from Boger’s A matrix to bound those from the matrix MA. 
Since this violated Durbin and Watson’s theory, upper and lower bounds could not be 
calculated for Boger’s statistic, when the number of independent variables was less than 


three. 


C. SCHMIDT’S STATISTIC EXTENDED TO THE AR(1,4) PROCESS 
Since it was desirable to have a statistic which was not limited by the number of 
independent variables, a variation of Schmidt’s statistic to fit the AR(1,4) process was 


investigated. Schmidt was able to show that the statistic d, where 


and 
a 2,4,2 
= ics -€.5)° / per (eqn 3.22) 
cau mag 


provides a test to detect a second order autoregressive disturbance. 
Using Schmidt’s research this paper proposes that it is possible to detect the 
AR(1,4) process by way of the statistic di 4 where: 


dy 4 ee acl (eqn 3.23) 


S| 


Or 


24.2.0 24.2 
a a= Sere DoS: Be Ce) ue (eqn 3.24) 
{72 tz/ e275 cz/ 


In matrix notation this statistic 1s given as 


eA, ge/ee (eqn 3.25) 
where 
Al 4 = Ay = Ay (eqn Sy4iay) 


or Ay gq equals: 


2 =1 00 OF > Ue ae - 0 
eee ee NG 

O -l 3 =) O90 =] \cCmeay 

O 0 -l 3 =O - 1 0m) 

-l 0 0 -) 4) -l) SOS le 
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Results established by Imhoff and Durbin and Watson were used to determine 
upper and lower bounds for the statistic dy 4: Koerts and Abrahamse [Ref. 8] have 
provided Fortran 66 code which allows the determination of the distribution of a 
quadratic form in normal variables by way of Imhoff's distribution. This code was 
updated to Fortran 77 standards and modified to allow the calculation of the di 4 
Statistic. The algorithm utilizes Imhoffs distribution to determine five percent 
significance points of the statistic by successively halving the range of the function 


until 


E(d) a —e05; (eqn 3.27) 
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Where F(d; ) is the Imhoff distribution function as defined in the previous chapter. 
Truncation and integration errors are controlled by input parameters. It was found 
that if these errors were set to less than .0001, the program required inordinate 
amounts of CPU time. At a .0001 level of accuracy, the five percent significance points 
utilized $2500 of computer resources. At a .00001 level of accuracy, approximately a 
six fold increase in resources can be expected. Five percent significance points for the 
statistic dig are listed in Table I. In Table I, K is the number of regressors including 
the intercept term and T is the number of observations. 

Currently, if an AR(1,4) process is suspected to exist in the data a two-step 
procedure is generally used. Specifically, this involves first testing and correcting for 
the AR(1) component of the disturbance and then doing the same for the AR(4) 
component. The AR(1,4) procedure proposes to test and correct the AR(1,4) 
disturbance in one step where both components are simultaneously corrected. Once 
autocorrelation has been determined to exist within the data for both procedures, 
values for the autocorrelation coefficients p; and Pg must be determined. In the 


two-step procedure, py is calculated by way of the formula 
Py = |- Od), (eqn 3525) 


Where d, is the Durbin-Watson statistic. After an appropriate transformation, the data 
are tested in a sequential manner as discussed in the previous chapter until a higher 
order disturbance 1s detected. If a fourth order disturbance is detected, py is estimated 


by the formula 
pa = Wee 55d, (eqns) 


Where dy is the statistic tabulated by Wallis. As can be seen by this procedure, the 
calculation of values for p, and Pg are in a sense conditional upon each other. That ts 
since ,‘s value is used as part of the original data transformation and Pg 1s determined 
by way of the Wallis statistic which is calculated from the transformed data, the value 
of py depends on the value of py. 

There are two important assumptions concerning the two-step procedure as it 
relates to the AR(1,4) disturbance. The first of these is that the Durbin-Watson 
statistic will be used to detect the AR(1,4) process. That is, that an individual who 1s 
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testing for the AR(1,4) process will not calculate the Wallis statistic if the 
Durbin-Watson statistic fails to detect. This assumption should be fairly accurate in 
practice as an individual would most likely decide that autocorrelation is not present if 
the Durbin-Watson statistic indicated such. The second assumption which is closely 
related to the first, concerns the order of calculation for p; and Pg. Specifically, as 
previously shown, the formula derived from the Schmidt statistic contains a p, and Py 
term. Since this is the case, different results might have been obtained if py had been 
calculated by way of the Wallis statistic formula first, after which p; would have been 
calculated by the Schmidt statistic formula. This possibility was not investigated in 
this paper, but p,; was calculated by way of Durbin and Watson’s statistic after which 
P4 was determined from the Schmidt formula. 

For the AR(1,4) procedure values for p, and Pg will be calculated in a similar 


manner. Specifically, it is proposed that p, will be calculated by the formula 
Py = |- Sd} {eqn 3.30) 
and Py by the formula 


Pq =?2- Py - Sd 4: {eqn 3319 


The formula for pg is derived from the original definition of dy g or 


e a 
dig =(D (e,-e,.1)7 + ¥ (ee, 4)2) Dyer (eqn 3.32) 
#32 ees wey 

If the formula for d, 4 is factored and asymptotic arguments applied, after appropriate 
rearrangement of can the formula for py is achieved. The validity of these formulas 
is argued in a manner similar to that for the two-step procedure. The difference is that 
values for p, and pg are determined in a single step. An advantage to this method is 
that a value for Pg is calculated prior to the data transformation, thus its value will not 
be influenced by any peculiarities which may have occurred during the data 
transformation. 

Once estimates for p; and py have been determined, a method to utilize them in 
correcting the AR(1,4) disturbance must now be determined. As previously discussed, 


this requires that a matrix P be found such that P’P = yl where yi! is the inverse of 
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the sample covariance matrix. Once the P matrix is determined, the data can be 


transformed such that 


* 


hee = Pe (eqn 3.33) 
* 

X = Px, (eqn 3.34) 
* 

e = Pe, (edn: 3.35) 


and least squares applied to the model 
*k * * 

Y= ec: , (eqn 3.36) 
where the estimator for f is 

f. 

ie=" 0 PPX) XP PY. (eqn 3.37) 

A. . 
If B is determined from the transformed data, it should be more efficient than estimates 
from ordinary least squares. 


Beach and MacKinnon [Ref. 11], in their paper on maximum likelihood 


estimation of a second order process, defined the P matrix as 


a 
b c¢ 
“P2 -Py | 


a 


They proposed a secondary method to determine values for a, b and c where P’P is 
constrained to be proportional to the inverse of the variance-covariance matrix as 
given be Siddiqui [Ref. 10] and a, b and c are solved for in the implied restrictions. In 
a manner similar to that proposed by Beach and MacKinnon, this paper will solve for 
the P matrix of the fourth order process. 

From Siddiqui’s paper on the inversion of the sample covariance matrix it was 


possible to determine that the inverse of the sample covariance matrix, wil, equals 


1 “*) 0 0 “a, 0 : 0 
“a, lta,” “ay 0 414, “a, 0 0 
0 -*) 1ta,° “3, 0 2,3, -a, 0 0 
0 0 “a, Lea,” "a, 0 217, “a, 0 
7a, 21%, 0 “a, lea, “+a? “a, 0 213, “a, 0 
0 “a, 
0 0 
0 0 
0 “a, 
0 ay 0 0 “ay 1 


Where a; =p, and ag=py. This matrix was obtained from the wi! matrix for a fourth 
order process after setting the terms for ay and a3 equal to zero, where ay=p> and 
a = p3- 

The P matrix for the AR(1,4) process is defined such that P equals: 
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By following Siddiqui’s method in the context as proposed by Beach and MacKinnon, 


10 equations in 10 unknown are defined 


a’ + be + d* + g? = |- aes (eqn 3.38) 
Oe dem ch = -, fe dite 3.) 
fd jo = 0); (eqn 3.40) 
gi = -P Py (eqn 3.41) 
c+ ett he = 1+ De - ‘ (eqn 3.42) 
foes 2 -D (eqn 3.43) 


S20 


ja SU (eqn 3.44) 


2+ i2= 1+ p,*-p,2, (eqn 3.45) 
1} — “Py: (eqn 3.46) 
eT ope (eqn 3.47) 


Solving these equations for the 10 unknowns, the elements of the previously defined P 


matrix are determined where 


a= J (1- pg? - b2- d2- gy, (eqn 3.48) 
b = (-p, - gh -de) /¢, (eqn 3.49) 
c= (1 + py2- pg2-h?- ey (eqn 3.50) 
d= (-py2pgv(1 - pg2)/((1 - pg) V (Cl = pq2)? = 9y242)), (eqn 3.51) 
© = (ppv (1 = p2))/ CV (l= Dae)? - Py29a2)), (eqn 3.52) 
f= J (1 - pq)” - py7p47) / (1 - pa), (eqn 3.53) 


40 


meP Pal J (1- ye (eqn 3.54) 


ga 
| 


h = QO, (eqn 3.55) 


MS pe Caney (eqn 3.56) 
me be pe). (eqn 3.57) 


The previously defined matrix with substituted values as calculated above satisfies the 
required property that P’P = wl, 
Dp. BEFECTIVENESS OF THE AR(1,4) PROCEDURE 
As with any new procedure, it is important that its effectiveness relative to older 

procedures be determined. In this case the AR(1,4) procedure will be compared to the 
two-step procedure. Specifically, these procedures shall be compared in three areas: 

¢ Accuracy of estimate for py, 

e Ability to detect the AR(1,4) disturbance 

¢ Accuracy of estimates for regression parameters 

Two simulation programs written in APL were utilized to generate data for the 

analysis required to make the comparison. These programs are displayed in the 
appendix. Program Stats is used to generate data for the comparison of the estimates 
of py and the ability to detect the AR(1,4) disturbance. This program utilizes the APL 
programs Norrand and Unirand for the generation of normally distributed random 
numbers. Norrand and Unirand are generic APL random number generators on the 
IBM 370/3033AP computer system which was used to perform the simulations. After 
the program generates regression residuals, the Durbin-Watson, Wallis and Schmidt 
Statistics are calculated. The Durbin-Watson and Schmidt statistics are then compared 
to their respective bounds for the selected number of observations and independent 
variables. If the statistic detects the disturbance, a counter will be incremented. In 
addition to determining whether a detection is made or not, the statistics are used in 


previously discussed formulas to calculate values for p; and py. 
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Program Regress is the second simulation used. Its specific purpose is to 
generate data which will allow the comparison of regression parameters from both the 
two-step and AR(1,4) procedures. This program as does the previous, utilizes the 
Norrand and Unirand routines to generate normally distributed random numbers. In 
the program, values for one independent variable and intercept term were generated for 
a predetermined number of observations, from a normal distribution with zero mean 
and a variance of .0625. After generating the error term from the AR(1,4) model for 
specified values of p, and py, the dependent variables are determined by way of the 
general linear model with a value for B of one. Using this data, estimates of B are 
determined by way of ordinary least squares, the two-step procedure and the AR(1,4) 
procedure. 

As previously discussed in Chapter I, it was decided to determine how well the 
AR(1,4) procedure performs relative to the two-step procedure for a best case and 
worst case scenario. For the best case scenario, the number of observations was 100 
and the error term was generated from a normal distribution with zero mean and a 
variance of .O1. For the worst case the number of observations was 20 and the 
variance of the normal distribution was 10. For both the best and worst case 
scenarios, 200 replications were performed for 17 combinations of p, and Pg. The 
number of combinations which could be investigated was limited by the initial terms of 
the P matrix which contained square roots. The problem was that for certain 
combinations of p, and py the term under the square root became negative. The 17 


combinations of p, and pg which were investigated were 


1 p, =. Py = .l 


2p, =-l pg= 3 
> Pi = -aaae7 ae 
f Py ae ee 
> py =e Pa ae 
Opp =3 Pg=.l 
1 Py = Spa as 
SPs 2 pa 
oP) = see ee 
10> Pi =o eS 
ll py =.) pax .3 
[2p | =P 
Is Pp) > 7 pa 


42 


Peim! pa= 2 
ep) Py =e 
rope 29 py =! 
fete Paso 


For a given value of p), any value of py greater than the displayed values for Py 
caused the problem with terms under square roots in the P matrix. In subsequent 
graphs used to display results, each combination of Pp, and py will be referred to by the 
number as assigned above. Thus combination one is given where DS ealleetiael tS 
.l. The effectiveness of the procedure in handling negative autocorrelation was not 
investigated. 

Mean square error (MSE) was selected to be the measure by which the best 
procedure would be selected. In the comparison of ®'s from both procedures, MSE 


was estimated separately for the slope and intercept terms utilizing the formula 
ON 
MSE = ¥ (PB - u)*/n (eqn 3.58) 


where u was the actual value of the slope or intercept and n was the number of 
simulation replications. Specifically, the value of u was unity for this application. Ina 
i aN ash: 

similar manner, the MSE for the vector § consisting of the slope and intercept term 


was determined via the formula 


MSE = ¥ (By - 1X8, - )2/n (eqn 3.59) 


where Bo is the intercept term and By is the slope term. Mean square error for the 
comparison of py’s was calculated in a similar manner. 

Power curves were constructed to allow the determination of which procedure 
could best detect the AR(1,4) disturbance. These curves were constructed by plotting 
for each p,and py combination the number of times each procedure detected the 
disturbance out of 100 possible attempts. As previously mentioned, if a trial turned 
out to be inconclusive, it was not counted as a detection. The number of times for 
each P,.Py combination that the procedures were inconclusive was plotted to enable 


readers who consider inconclusive trials as detections to reevaluate the power curves. 
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E.. SRESUETS 

Comparisons of mean square error for estimates of pq are shown in Figures 3.1 
and 3.2. Figure 3.1 displays mean square errors for the worst case scenario while 
Figure 3.2 displays MSEs for the best case situation. Mean squared errors for both the 
best and worst case scenario are consistent for both the AR(1,4) and two-step 
procedure estimates. For the best case scenario the two-step estimate 1s clearly 
superior for P;,P4 combinations greater than nine. MSEs for two-step and AR(1,4) 
procedure estimators for the worst case scenario diverge for p),P4 combinations greater 
than I5. This indicates that for the worst case scenario, the AR(1,4) procedure 
estimate for Pq can be expected to perform as well as the two-step estimate. 

As previously discussed, an individual using the two-step procedure to test for an 
AR(1,4) disturbance might initially use the Durbin-Watson statistic. If the test 
indicated that autocorrelation did not exist, further testing with the Wallis statistic 
might not be considered. Since this was the case assumed, power curves were created 
which compared the Schmidt and Durbin-Watson statistics. The Schmidt statistic 1s 
considered as part of the AR(1,4) procedure while the Durbin-Watson is part of the 
two-step procedure. For the best case scenario with T = 100 and (7 = .O1, shown in 
Figure 3.4, the Schmidt statistic except for the sixth combination 1s more powerful than 
the Durbin- Watson statistic. For this scenario, both statistics are equally powerful for 
P},P4 combinations greater than 10. For the worst case situation, shown in Figure 3.3, 
both statistics are equally capable across all combinations in detecting the AR(1,4) 
disturbance. As previously discussed the power curves for both procedures do not 
include inconclusive test results. Figures 3.5 and 3.6 display the number of times that 
the test statistics were inconclusive for the worst and best case scenarios respectively. 

Figures 3.7 through 3.12 show comparisons of mean square errors for the 
regression parameters of the two-step and AR(1,4) procedures. Figures 3.7 and 3.8 
show MSEs calculated using both By and By for worst and best case situations, 
respectively. For the worst case with T = 20, 17 = 10 scenario, shown in Figure 3.7, 
MSEs for both procedures follow similar trends which are consistent with each other. 
Figures 3.9 and 3.10 which display MSEs for the intercept and slope coefficients, 
respectively, show trends which support the previous findings of Figure 3.7. Mean 
square errors for the best case scenario are shown in Figures 3.8, 3.11 and 3.12. As 
shown in Figure 3.8, both procedures follow similar trends for P).P4 combinations up 


to 15. Clearly the two-step procedure diverges at the py,Pq combination 15. Figures 
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3.11 and 3.12 indicate that this divergence is related to a divergence of the intercept 
estimate for the two-step procedure. Overall, it appears that regression parameters 
from both procedures are consistent with each other and that one procedure is not 


superior to the other with regards to the estimation of regression parameters. 


F. CONCLUSIONS AND RECOMMENDATIONS 

Overall, the AR(1,4) procedure as previously developed is just as capable as the 
two-step procedure in correcting the AR(1,4) disturbance for the p,,p4 combinations 
for which it can be used. The Schmidt statistic as part of this procedure 1s effective in 
detecting the AR(1,4) disturbance. In certain situations, its abilities surpass those of 
the Durbin-Watson statistic from the two-step procedure. In particular, its capabilities 


are noteworthy at higher values of T, lower values of 1? 


and lower p;,p4 combinations. 
It appears from the results that estimates of py calculated via the two-step procedure 
are better than estimates from the AR(1,4) procedure, at least for the order of 
calculation of pg which was tested. If an individual suspects that an AR(1,4) 
disturbance is present in the data then it is recommended that the Schmidt, 
Durbin-Watson, and Wallis statistics be calculated. The Schmidt statistic should be 
utilized to detect the disturbance while the Durbin-Watson and Wallis statistics are 
used to calculate values of py and py. If the values for p, and Py are acceptable, then 
they should be used in the AR(1,4) P matrix to correct for the disturbance. If the py 
and py values fall out of the acceptable range for the AR(1,4) procedure then the 


two-step procedure must be used. 


G. AREAS OF FUTURE RESEARCEHI 

Three areas of future research are suggested as a result of this research. The first 
of these would be to find a P matrix for the AR(1,4) procedure which was not limited 
by values of p,; and py. A second area of research would be to derive a correction 
procedure for a general fourth order autoregressive process. This derivation would 
follow closely what was done above for the AR(1,4) process. 

A final area of research would be to investigate the maximum likelihood 
estimation of a general AR(4) or an AR(1,4) process. Beach and MacKinnon [Ref. 11] 
have investigated the second order process and suggest possible ways to extend this 


procedure to the general fourth order case. 
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Program Stats is an APL simulation which was used to generate data for the 


comparison of the estimates of py and the ability of the Schmidt statistic as part of the 


AR(I 


,4) procedure and the Durbin-Watson statistic as part of the two-step procedure 


to detect the AR(1,4) disturbance. 


fae 

C2] 

i 

C4 

ey) 

C6] 

C7] 

8] 

C9] 

[10] 
C11] 
[12] 
C13] 
C14] 
Cali 
C16] 
C17] 
[18] 
C19] 
i200 
[21] 
(200 
[23] 
C24] 
[25] 
[26] 
E278 
230 
[29] 
[30] 


V STATS;AD13;AD;AW1B8B;AW1;3:AW23;AW3;AW4B;3AW4 3AW53AW3AS3 PW 

A ENTER VALUES FOR RH01 AND RHO 

'ENTER A VALUE FOR RHO1'! 

RHO1i<0 

'ENTER A VALUE FOR RHO! 

RHO¥<O 

"ENTER NUMBER OF OBSERVATIONS '! 

NOBS<U 

'ENTER VARIANCE OF NORMAL DISTRIBUTION! 
SIG2<0 

A INITIALIZE BOUND FOR STATISTICAL TESTS 
DWL<1.65 

DWU<1.69 

DWAL<1.59 

DWAU<1.63 

DSMTL<3 .45 

DSMIG<e3 vos 

QA INITIALIZE OUTER LOOP VECTORS AND COUNTERS 
RHOAV1<4000 

RHOAV¥S<40p00 

RHOAV4<4000 

R1<4000 

R4S<4000 

R4<4000 

DWCNT<0 

DSTCNTI<0 

DWACNT<0 

DWINCT<0 

SLINCL<O 

DWAICT+<0 

A GENERATE DURBIN-WATSON 'A' MATRIX 
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[31] NO<NOBS 

[32] AD1B<(((NO-2)p0), 1,2, 1)),((NO-2)p0),°1,1 
[33] AD1i<1, 1,(((NOx(NO-2))+(NO-2))pAD1B 

[34] AD<«(NO,NO)pAD1 

[35] a GENERATE THE WALLIS 'A' MATRIX 

[36] AW1B<(((NO-5)-3)p0) 

Borne (< (NO-5 4450 ),1,0,0,0,°1)),AW1B) 
[38] AW1<1,0,0,0, 1,(((€(3xN0O)+3)pAW1C 

[39] AW2<(((NO-8 )xNO)-9)p(((NO-8)p0), 1,0,0,0,2,0,0,0, 1) 
Eee AW34°1,0,0,0,2,0,0,0, 1,AW2 

C41] AWUB<(((3xNO )+3))p (CC (NO-5)+1)p0), 1,0,0,0,1) 
pi ante COCVO—5 )-3 00), 1.,.0.0,0,1,AVUB 

C43] AW5<AW1,AW3,AW4 

C44] AW<«(NO,NO)pAWS 

[45] a GENERATE THE SCHMIDT 'A' MATRIX 

[46] AS<+AD+AW 

[47] a OUTER LOOP FOR REPLICATION 

[48] COUNTO<0 

[49] OUTER: COUNTO<COUNTO+1 

[50] a INITIALIZE INNER LOOP VECTORS 

[51] RHOHAT1<10p0 

[52] RHOHATYS<10p0 

[53] RHOHATY4<10p0 

[54] a INNER LOOP TO CALCULATE RHO1, RHOU AND DETERMINE DETECTION 
[55] COUNTI<0 

[56] INNER: COUNTI<COUNTI+1 

[57] DETDW<0 

[58] DETSMT+<0 

[59] DETWA<0 

[60] INSTCT<0 

[61] INDWCT+<0 

[62] INDWAT+0 

[63] a GENERATE THE RANDOM ERROR 

[64] V+NOBS NORRAND(0,SIG2) 

[65] E<+NOBSpO0 
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C66] N<4 

[67] a CALCULATE CORRECTION TERMS 

[68] J<(1-(RHOU*2))x0.5 

[69] I<«-(RHO1:/7) 

C70] G<RHOUXI 

C71] F<( 14+ (RECT R22) 0 
C72) £l<-CReCr 

[73] De=(ixC)tF 

C74] CHCl +CRRO1* 20 —CRROS x2) (E20. 
[75] Bi<-(RHO1-(DxE1))+#C 

[76] A<(1~- (CRHOY*2 )- (6x2 )-(Dx2 )-(B1*2))*0.5 
[77] a CALCULATE THE FIRST FOUR ERROR TERMS 
[7oi 2 iy ied 

C79] EL21<(V021]-(81xE011))+#C 

[80] FC3]<(V(3]-(DxE([1] )-(£1xE[2]))+F 

C81] EC4¥l<(VC4)]-(GxEC1) )-(IxEC3) ) td 

C82] MM:N<N+1 

[83] ECN]<(RHO1xE(N-1] )+(RHOUXECN-4] )+V(N] 
[84] +(N<NOBS)/MM 

[85] E<(NOBS,1)pE 

[86] a GENERATE THE INDEPENDENT VARIABLES 
[87] X1<(NOBS,1)p(NOBSp1) 

[88] X2<(NOBS,1)p(NOBS NORRAND 0 0.0625) 
C89] X<X1,X2 

[90] a GENERATE THE TRUE BETA 

[91] BETAT<21p11 

[92] a GENERATE THE DEPENDENT VARIABLES 

C93] Y<(X+.xBETAT )+E 

[94] a LEAST SQUARES ESTIMATE OF BETA 

[95] B<YBXx 

[96] a GENERATE THE RESIDUALS 

(97] EHAT<Y-(X+.xB) 

[98] a CALCULATE THE DURBIN-WATSON STATISTIC 
C99] DW<, ((QEHAT )+.x(AD+.xEHAT ) )+( (QEHAT )+.xEHAT ) 
[100] a CALCULATE THE SCHMIDT STATISTIC 
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C101] DS<, ((QEHAT )+.x(AS+.xEHAT) )+( (QEHAT )+.xEHAT) 
[102] a COMPARE TABLED VALUE FOR DURBIN-WATSON STATISTIC TO 
[103] a THE ABOVE CALCULATED VALUE FOR THE DURBIN-WATSON 
[104] DETDW<(DWSDWL ) 

[105] DWCNT<DWCNT+DETDW 

C106] INDWCT<( (DWL<DW)A(DWU>DW) ) 

C107] DWINCT<DWINCT+INDWCT 

C108] a COMPARE TABLED VALUE FOR SCHMIDT STATISTIC 

C109] a TO ABOVE CALCULATED VALUE FOR SCHMIDT STATISTIC 
[110] DETSMT<(DSSDSMTL) 

C111] DSTCNT<DSTCNT+DETSMT 

[112] INSTCT<((DSMTL<DS)A(DSMTU>DS)) 

[113] STINCT<STINCT+tINSTCT 

[114] a CALCULATE THE DURBIN-WATSON ESTIMATE FOR RHO1 
[115] RHOHAT1C(COUNTI1<1-(DW+t2) 

[116] a CALCULATE THE SCHMIDT ESTIMATE FOR RHO 

[117] RH4S1<2-RHOHATILCOUNTI] 

[118] RA4S2<DS+2 

[119] RHOHATYS(COUNTI1<RH4S1-RHYNS2 

[120] a GENERATE TRANSFORM MATRIX FOR FIRST ORDER PROCESS 
[121] PD<,(1-(RHOHATICCOUNTI]*2))x0.5 

[122] PW<((NOXNO)-1)p(((NO-1)p0), (-RHOHAT1(COUNTI]),1) 
[123] PDW<(NO,NO)pPD,PW 

[124] a TRANSFORM RESIDUALS 

[125] ESTAR<PDW+.xEHAT 

[126] a CALCULATE THE WALLIS STATISTIC 

[127] DWA<, ( (RESTAR )+.x(AW+.xESTAR) )+ ( (RESTAR )+.xESTAR) 
[128] a COMPARE TABLED VALUE OF WALLIS STATISTIC 

[1291 Ae2eCALGYLATED VALUE OF WALLIS STATISTIC 

[130] DETWA<(DWASDWAL ) 

[131] DWACNT<DWACNT+DETWA 

[132] INDWAT<( (DWAL<DWA)A‘(DWAU>DWA) ) 

[133] DWAICT<DWAICT+INDWAT 

[134] a CALCULATE THE WALLIS ESTIMATE FOR RHO 

C135] RHOHATY (COUNTI]<1- (DWA+2) 
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[136] 
[137] 
ache 
[139] 
Pea 
(141) 
C142] 
(143) 
C144] 
C145] 
C146] 


> (COUNTI<5)/INNER 
RHOAV1CCOUNTO]<(+/RHOHAT1 )+5 
RHOAVUS [COUNTO]<(+/RHOHATHS )+5 
RHOAV4 CCOUNTO]<(+/RHOHATY )#5 
> (COUNTO<40 )/OUTER 
CNTDW<DWCNT 

CNTDST<DSTCNT 

CNTDWA<DWACNT 

R1<RHOAV1 

RUS*+RHOAVYUS 

R4U<RHOAVY 

V 
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Program Regress is an APL simulation which generates data for the comparison 
of the regression parameters from both the two-step and AR(1,4) procedures. 
V REGRESS;AD;AD13;AW1B;AW1;AW23;AW3 3;AW4B3AW43AW53AW3AS 
C1J A ENTER VALUES FOR RHO1 AND RHO 
[2] 'ENTERA VALUE FOR RH01! 
C3] RHO1<D 
C4] ‘ENTER A VALUE FOR RHO! 
Cea) eso hes es, 
[C6] ‘ENTER NUMBER OF OBSERVATIONS! 
C71 NOBS<0 
[C8] ‘ENTER VARIANCE OF NORMAL DISTRIBUTION! 
C9] sSiIG2<0 
C10] DSMTL<3.45 
[11] DSMTU<3.53 
[12] a GENERATE DURBIN-WATSON 'A! MATRIX 
[13] NO<NOBS 
mie toa NO-2)90),11,2, 1)),(CNO-2)p0), 1,1 
[15] AD1<1, 1, (((NOx(NO-2))+(NO-2))pAD1B 
[16] AD<(NO,NO)pAD1 
C17] aA GENERATE THE WALLIS 'A' MATRIX 
Pusan Te<—(((NO=5 )-3 )p0) 
C19] AW1C<((((NO-5)+1)p0),1,0,0,0, 1)),AW1B) 
peewee iel,0,0,0, 1, ((C(3xN0 )+3 )pAWIC 
E21 Ay2< Cae o=ey <0) -9)pCCCNO-s9p0), 1,0,0,0,2,0,0,0, 1) 
(22 ds -910,0,0,2,0,0,0, 1,Av2 
[23] AW4B<(((3xN0)+3) pC CCCNO-5 )+1)p0), 1,0,0,0,1) 
[24] AW4<(((NO-5)-3)p0), 1,0,0,0,1,AW4B 
(25) AW5<AW1 ,AW3 ,AW4 
[26] AW<(NO,NO)pAWS 
[27] a GENERATE THE SCHMIDT 'A' MATRIX 
[28] AS<AD+AW 
[29] a GENERATE TRANSFORM MATRIX FOR FIRST ORDER PROCESS 
[30] NO<NOBS 
[oda eee, Cl-(RH01x2))*0.5 
[32] PW<((NOXNO )-1)p (CC (NO-1)p0),(-RH01),1) 
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[33] PDW<(NO,NO)pPD,PW 

[34] a GENERATE TRANSFORM FOR FOURTH ORDER PROCESS 
C35] 71<(1- (RHOU*2))*0.5 

[36] Pui<71,((NO-1)p0) 

[37] Pu2<0,71,((NO-2)p0) 

[38] P4u3<0,0,71,((NO-3 )p0) 

[39] Puy<0,0,0,71,((NO-4)p0) 

C40] Pu5<((NO-4)xNO )p ((-RHOU),0,0,0,1,¢((NO-4 )p0)) 
C41] P4u<(NO,NO)p (P41,P42,Pu3,P4u4u,Pus) 

[42] a INITIALIZE MATRICES AND VECTORS FOR OUTER LOOP 
C43] BLSIAV<40p0 

[uu] BLSSAV<40p0 

C45] B2SIAV<40p0 

C46] B2SSAV<40p0 

C47] Bi4YIAV<40p0 

C48] BiuSAV<40p0 

[49] DSBCNT<0 

[50] DSACNT<0 

PStivs7 7 eC7 <0 

[52] STANCT<0 

[53] a OUTER LOOP FOR REPLICATION 

[54] COUNTO<0 

[55] OUTER: COUNTO<COUNTO+1 

[56] a INITIALIZE MATRICES AND VECTORS FOR INNER LOOP 
[57] BLSI<5p0 

[58] BLSS<5p0 

[59] B2SI<5p0 

[60] B2SS<5p0 

[61] Biul<5p0 

[62] B14uS<5p0 

[63] a INNER LOOP TO CALCULATE RHO1, RHO AND DETERMINE DETECTION 
[64] COUNTI<0 

[65] INNER: COUNTI<COUNTI+1 

[66] a GENERATE THE RANDOM ERROR 

[67] V<NOBS NORRAND(0,SIG2) 


70 


[68] E<«NOBSpO 

C69] N<4¥ 

[70] a CALCULATE CORRECTION TERMS 

C71] Je (1-(RHOUx2))*0.5 

[72] I<-(RHO1:7) 

[73] G<+RHOUxl 

C74] Fe (it (RHO1*2 )- (RHOU*2 )-(I*2))*0.5 
C75] E1i<-(RHO1:F) 

[76] D<«-(IxG)+F 

C77] C<(1+(RHO1%*2 )-CRHOU*2 )-(E1%*2))*0.5 
C78] Bi<-(RHO1-(DxE1))+C 

C79] A<(1-(CRHOUx*2 )-(Gx2 )- (D*2 )-(B1*2))*0.5 
[80] a DETERMINE TRANSFORM MATRIX FOR 1,4 PROCESS 
[81] P141<A,((NO-1)p0) 

[82] P142<B1,C,((NO-2)p0) 

[83] P143<D,F1,F,((NO-3)p0) 

C8u] Pi4uu<¢,0,7,7,((NO-4)p0) 

C85] P145<((NO-4)xNO )p ((-RHOW),0,0,(-RHO1),1,((NO-4)p0)) 
[86] Pi4¥<(NO,NO)p(P141,P142,P143,P144,P145) 
C87] a8 CALCULATE THE FIRST FOUR ERROR TERMS 
(88] EL1IJ<Vl1ilJ+A 

C89] EC2]<(V(2]-(B1xE[1] ))+#C 

C90] EC3]<(VC31]-(DxE(1] )-(E1xE([2] ))+F 

[91] EC4¥l<(V[4I]-(GxE[1] )- (IxE3] )) td 

[92] MM:N<N+1 

[93] ECN]<(RHO1xE(N-1] )+(RHOUXE(N-4] )+V CN] 
[94] +(N<NOBS)/MM 

[95] E<«(NOBS,1)pE 

[96] a GENERATE THE INDEPENDENT VARIABLES 
[97] X1<(NOBS,1)p(NOBSp1) 

[98] X2<«(NOBS,1)p(NOBS NORRAND 0 0.0625) 
[99] X<«X1,X2 

[100] a GENERATE THE TRUE BETA 

[101] BETAT<«21p11 

[102] a GENERATE THE DEPENDENT VARIABLES 
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[103] Y<(X+.xBETAT )+E 

[104] a LEAST SQUARES ESTIMATE OF BETA 

[105] BLS<YBxX 

[106] BLSICCOUNTII1<BLS([13] 

[107] BLSSCCOUNTII<BLS[23] 

[108] a GENERATE THE RESIDUALS 

[109] BHAT<Y-<(X+.xBLS) 

[110] a USE FIRST ORDER P MATRIX TO TRANSFORM Y ,X, AND EHAT 
[111] a ZRANSFORM Y 

[112] YSTAR1<PDW+.xyY 

[113] a TRANSFORM X 

[114] XSTAR1<PDW+. xX 

[115] a TRANSFORM RESIDUALS 

[116] ESTAR1<PDW+.xEHAT 

[117] a USE FOURTH ORDER P MATRIX TO TRANSFORM YSTAR1,XSTAR1 
[118] a AND ESTAR1 

[119] a TRANSFORM Y 

[120] YSTAR4<Put+.xYSTAR1 

[121] a TRANSFORM X 

[122] XSTAR4<+P4+.xXSTAR1 

[123] a TRANSFORM RESIDUALS 

[124] ESTAR4<+Pu+.xESTAR1 

[125] a CALCULATE VALUE FOR 2-STEP BETA 

[126] B2S<YSTARYEXSTARY 

[127] B2SITCCOUNTI1<B2S(13] 

[128] B2SSCCOUNTII1<B2S[23] 

[129] a CALCULATE THE SCHMIDT STATISTIC BEFORE 
[130] a TRANSFORMATION 

[131] DS1C<( (REHAT )+.xEHAT) 

[132] DS<, ((QEHAT )+.x(AS+.xEHAT ) )+DS1C 

[133] a COMPARE TABLED VALUE FOR SCHMIDT STATISTIC 
[134] a TO ABOVE CALCULATED VALUE FOR SCHMIDT STATISTIC 
[135] DEBSMT<(DSS<SDSMTL) 

[136] a TF DETECTED INCREMENT 1 

[137] DSBCNT<DSBCNT+DEBSMT 
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[138] INSBCT+«( (DSMTL<DS )A(DSMTU>DS)) 

[139] a TF INCONCLUSIVE INCREMENT 1 

[140] STIBCT+«STIBCT+INSBCT 

C141] a USE 1,4 PMATRIX TO TRANSFORM Y,X, AND EHAT 
[142] a TRANSFORM Y 

C143] YSTAR14<P14+.xy 

[144] a TRANSFORM X 

C145] XSTAR14<P14+.xxX 

C146] a CALCULATE VALUE FOR 1,4 BETA 

C147) B1iu<YSTARINEXSTAR14 

C148] Baul CCOUNTI]<B1i4 [13] 

[149] B1usCCOUNTI]<B14[23] 

[150] a CALCULATE RESIDUALS 

[151] ESTAR14<YSTAR14-(XSTAR14+.xB14) 

C152] a CALCULATE THE SCHMIDT STATISTIC AFTER 
[153] a THE TRANSFORMATION 

[154] DS2D<( (QESTAR14 )+.xESTAR14 ) 

[155] DS<, ((QESTAR14 )+.x(AS+.xESTAR14 ) )+DS2D 
[156] A COMPARE TABLED VALUE FOR SCHMIDT STATISTIC 
C157] a TO ABOVE CALCULATED VALUE FOR SCHMIDT STATISTIC 
C158] DETSMA<(DSSDSMTL) 

[159] a IF DETECTED INCREMENT 1 

[160] DSACNT<DSACNT+DETSMA 

[161] INSACT<((DSMTL<DS )A(DSMTU>DS )) 

[162] a IF INCONCLUSIVE INCREMENT 1 

[163] STANCT<STANCT+INSACT 

[164] >+(COUNTI<5)/INNER 

[165] a CALCULATE AVERAGE VALUES FOR BETAS 

[166] BLSIAVLCOUNTO]<(+/BLSI)+5 

[167] BLSSAVCCOUNTO]<(+/BLSS )+5 

[168] B2STAV[COUNTO]<(+/B2SI )+5 

[169] B2SSAV[COUNTO]<(+/B2SS)+5 

C170] BiYIAVCCOUNTOI]<(+/B14I )+t5 

[171] B1iusAVLCOUNTO]<(+/B14S )+#5 

[172] +(COUNTO<40)/OUTER 
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Dale? s)a 
[174] 
Baoy/ Sal 
Pfau 
Gaby 7 
Palys} 
[179] 
[180] 


AF<DSACNT 
BE<DSBCNT 
LSI<BLSIAV 
LSS<BLSSAV 
I2S<B2SIAV 
S2S<B2SSAV 
T14<BiulTAv 
S14<B14USAV 
V 
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